智能论文笔记

Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models

Igor Kulev , Berkay Köprü , Raul Rodriguez-Esteban , Diego Saldana , Yi Huang , Alessandro La Torraca , Elif Ozkirimli

分类：自然语言处理 | 机器学习

2021-11-12

生物重建VII轨道3挑战重点是在Twitter用户时间表中识别药物名称。对于我们提交这一挑战，我们通过使用多种数据增强技术扩展了可用的培训数据。然后，增强数据用于微调在一般域推特内容上预先培训的语言模型的集合。拟议的方法优于先前的最先进的算法Kusuri，并在竞争中排名高，为我们所选择的客观函数重叠F1分数。

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Uncovering the Dark Side of Telegram: Fakes, Clones, Scams, and Conspiracy Movements

Massimo La Morgia , Alessandro Mei , Alberto Maria Mongardini , Jie Wu

分类：机器学习

2021-11-26

电报是全球最常用的即时消息传递应用之一。其成功之所以在于提供高隐私保护和社交网络，如频道 - 虚拟房间，其中只有管理员可以发布和广播到所有订户的消息。然而，这些相同的功能促成了边界活动的出现，并且与在线社交网络一样常见，假账户的沉重存在。通过引入频道的验证和诈骗标记，电报开始解决这些问题。不幸的是，问题远未解决。在这项工作中，我们通过收集35,382个不同的渠道和超过130,000,000消息来进行大规模分析电报。我们研究电报标记为验证或骗局的渠道，突出显示类比和差异。然后，我们转到未标记的频道。在这里，我们发现一些臭名昭着的活动也存在于虚拟网络的隐私保存服务，例如梳理，共享非法成人和版权保护内容。此外，我们还确定并分析了另外两种类型的渠道：克隆和假货。克隆是发布另一个频道确切内容的频道，以获得订阅者和促进服务。相反，假货是试图冒充名人或知名服务的渠道。即使是最先进的用户甚至很难确定。要自动检测假频道，我们提出了一种机器学习模型，可以以86％的准确性识别它们。最后，我们研究了Sabmyk，这是一种阴谋理论，即利用假货和克隆在达到超过1000万用户的平台上迅速传播。

translated by 谷歌翻译

Improving Performance in Neural Networks by Dendrites-Activated Connections

Carlo Metta , Marco Fantozzi , Andrea Papini , Gianluca Amato , Matteo Bergamaschi , Silvia Giulia Galfrè , Alessandro Marchetti , Michelangelo Vegliò , Maurizio Parton , Francesco Morandin

分类：神经与进化计算 | 机器学习

2023-01-03

Computational units in artificial neural networks follow a simplified model of biological neurons. In the biological model, the output signal of a neuron runs down the axon, splits following the many branches at its end, and passes identically to all the downward neurons of the network. Each of the downward neurons will use their copy of this signal as one of many inputs dendrites, integrate them all and fire an output, if above some threshold. In the artificial neural network, this translates to the fact that the nonlinear filtering of the signal is performed in the upward neuron, meaning that in practice the same activation is shared between all the downward neurons that use that signal as their input. Dendrites thus play a passive role. We propose a slightly more complex model for the biological neuron, where dendrites play an active role: the activation in the output of the upward neuron becomes optional, and instead the signals going through each dendrite undergo independent nonlinear filterings, before the linear combination. We implement this new model into a ReLU computational unit and discuss its biological plausibility. We compare this new computational unit with the standard one and describe it from a geometrical point of view. We provide a Keras implementation of this unit into fully connected and convolutional layers and estimate their FLOPs and weights change. We then use these layers in ResNet architectures on CIFAR-10, CIFAR-100, Imagenette, and Imagewoof, obtaining performance improvements over standard ResNets up to 1.73%. Finally, we prove a universal representation theorem for continuous functions on compact sets and show that this new unit has more representational power than its standard counterpart.

translated by 谷歌翻译

Fruit Ripeness Classification: a Survey

Matteo Rizzo , Matteo Marcuzzo , Alessandro Zangari , Andrea Gasparetto , Andrea Albarelli

分类：计算机视觉 | 机器学习

2022-12-29

Fruit is a key crop in worldwide agriculture feeding millions of people. The standard supply chain of fruit products involves quality checks to guarantee freshness, taste, and, most of all, safety. An important factor that determines fruit quality is its stage of ripening. This is usually manually classified by experts in the field, which makes it a labor-intensive and error-prone process. Thus, there is an arising need for automation in the process of fruit ripeness classification. Many automatic methods have been proposed that employ a variety of feature descriptors for the food item to be graded. Machine learning and deep learning techniques dominate the top-performing methods. Furthermore, deep learning can operate on raw data and thus relieve the users from having to compute complex engineered features, which are often crop-specific. In this survey, we review the latest methods proposed in the literature to automatize fruit ripeness classification, highlighting the most common feature descriptors they operate on.

translated by 谷歌翻译

Persistence-based operators in machine learning

Mattia G. Bergomi , Massimo Ferri , Alessandro Mella , Pietro Vertechi

分类：机器学习

2022-12-28

Artificial neural networks can learn complex, salient data features to achieve a given task. On the opposite end of the spectrum, mathematically grounded methods such as topological data analysis allow users to design analysis pipelines fully aware of data constraints and symmetries. We introduce a class of persistence-based neural network layers. Persistence-based layers allow the users to easily inject knowledge about symmetries (equivariance) respected by the data, are equipped with learnable weights, and can be composed with state-of-the-art neural architectures.

translated by 谷歌翻译

Lexicographic Multi-Objective Reinforcement Learning

Joar Skalse , Lewis Hammond , Charlie Griffin , Alessandro Abate

分类：机器学习

2022-12-28

In this work we introduce reinforcement learning techniques for solving lexicographic multi-objective problems. These are problems that involve multiple reward signals, and where the goal is to learn a policy that maximises the first reward signal, and subject to this constraint also maximises the second reward signal, and so on. We present a family of both action-value and policy gradient algorithms that can be used to solve such problems, and prove that they converge to policies that are lexicographically optimal. We evaluate the scalability and performance of these algorithms empirically, demonstrating their practical applicability. As a more specific application, we show how our algorithms can be used to impose safety constraints on the behaviour of an agent, and compare their performance in this context with that of other constrained reinforcement learning algorithms.

translated by 谷歌翻译

On the Complexity of Representation Learning in Contextual Linear Bandits

Andrea Tirinzoni , Matteo Pirotta , Alessandro Lazaric

分类：机器学习 | (统计)机器学习

2022-12-19

In contextual linear bandits, the reward function is assumed to be a linear combination of an unknown reward vector and a given embedding of context-arm pairs. In practice, the embedding is often learned at the same time as the reward vector, thus leading to an online representation learning problem. Existing approaches to representation learning in contextual bandits are either very generic (e.g., model-selection techniques or algorithms for learning with arbitrary function classes) or specialized to particular structures (e.g., nested features or representations with certain spectral properties). As a result, the understanding of the cost of representation learning in contextual linear bandit is still limited. In this paper, we take a systematic approach to the problem and provide a comprehensive study through an instance-dependent perspective. We show that representation learning is fundamentally more complex than linear bandits (i.e., learning with a given representation). In particular, learning with a given set of representations is never simpler than learning with the worst realizable representation in the set, while we show cases where it can be arbitrarily harder. We complement this result with an extensive discussion of how it relates to existing literature and we illustrate positive instances where representation learning is as complex as learning with a fixed representation and where sub-logarithmic regret is achievable.

translated by 谷歌翻译

Enriching Relation Extraction with OpenIE

Alessandro Temperoni , Maria Biryukov , Martin Theobald

分类：自然语言处理 | 机器学习

2022-12-19

Relation extraction (RE) is a sub-discipline of information extraction (IE) which focuses on the prediction of a relational predicate from a natural-language input unit (such as a sentence, a clause, or even a short paragraph consisting of multiple sentences and/or clauses). Together with named-entity recognition (NER) and disambiguation (NED), RE forms the basis for many advanced IE tasks such as knowledge-base (KB) population and verification. In this work, we explore how recent approaches for open information extraction (OpenIE) may help to improve the task of RE by encoding structured information about the sentences' principal units, such as subjects, objects, verbal phrases, and adverbials, into various forms of vectorized (and hence unstructured) representations of the sentences. Our main conjecture is that the decomposition of long and possibly convoluted sentences into multiple smaller clauses via OpenIE even helps to fine-tune context-sensitive language models such as BERT (and its plethora of variants) for RE. Our experiments over two annotated corpora, KnowledgeNet and FewRel, demonstrate the improved accuracy of our enriched models compared to existing RE approaches. Our best results reach 92% and 71% of F1 score for KnowledgeNet and FewRel, respectively, proving the effectiveness of our approach on competitive benchmarks.

translated by 谷歌翻译

Quantum Clustering with k-Means: a Hybrid Approach

Alessandro Poggiali , Alessandro Berti , Anna Bernasconi , Gianna Del Corso , Riccardo Guidotti

分类：机器学习

2022-12-13

Quantum computing is a promising paradigm based on quantum theory for performing fast computations. Quantum algorithms are expected to surpass their classical counterparts in terms of computational complexity for certain tasks, including machine learning. In this paper, we design, implement, and evaluate three hybrid quantum k-Means algorithms, exploiting different degree of parallelism. Indeed, each algorithm incrementally leverages quantum parallelism to reduce the complexity of the cluster assignment step up to a constant cost. In particular, we exploit quantum phenomena to speed up the computation of distances. The core idea is that the computation of distances between records and centroids can be executed simultaneously, thus saving time, especially for big datasets. We show that our hybrid quantum k-Means algorithms can be more efficient than the classical version, still obtaining comparable clustering results.

translated by 谷歌翻译